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ABSTRACT 

Consensus  algorithms  permit  the  computation  of  global  statis¬ 
tics  via  local  communications  and  without  centralized  control. 
We  extend  previous  results  by  taking  into  account  fading  and 
unidirectional  links  in  ring  and  random  2-D  topologies.  We  study 
conditions  for  convergence  and  present  simulation  results  to  verify 
the  analytical  results  in  this  paper.  We  compare  the  performance 
of  consensus  algorithms  with  a  tree-based  (centralized)  approach. 
Additionally,  we  implement  a  slotted  ALOHA  protocol  and  com¬ 
pare  its  performance  to  that  under  the  initial  assumption  of  perfect 
scheduling. 


I.  INTRODUCTION 

In  the  classical  distributed  estimation  problem,  sensors  make 
noisy  observations  of  a  scalar.  The  (weighted)  average  of  these 
observations  is  a  good  estimate  under  many  scenarios;  this  average 
may  be  computed  by  nodes  sending  their  estimates  to  a  fusion  cen¬ 
ter  (the  centralized  version),  or  by  exchange  of  information  between 
nodes.  The  basic  consensus  problem  addresses  the  convergence 
of  such  schemes.  Centralized  approaches  require  the  computation 
of  a  tree,  rooted  at  the  fusion  node,  aggregation  of  data  up  the 
tree,  and  dissemination  of  the  consensus  value  down  the  tree.  See 
[1]  for  a  discussion  of  issues  and  approaches  related  to  scalable 
aggregation  and  scalable  inference.  Re-computation  of  the  tree 
would  be  required  under  mobility,  duty-cycling  and  fading.  The 
role  of  the  fusion  node  may  need  to  be  rotated  in  order  to  avoid 
single  points  of  failure,  and  depletion  of  resources  at  the  nodes 
neighboring  the  fusion  node.  Decentralized  approaches  may  be 
preferable  under  these  circumstances. 

The  distributed  consensus  problem  is  also  of  interest  in  several 
military  contexts.  Wireless  networks  are  becoming  more  prevalent 
in  tactical  scenarios.  Further,  decentralized  algorithms  are  desired 
so  that  resource  requirements  are  distributed  throughout  the  net¬ 
work.  Olfati-Saber  et  al  [3]  briefly  describe  several  military-relevant 
applications  that  distributed  consensus  would  improve.  Flocking 
theory  and  formation  control  techniques  are  applied  to  unmanned 
vehicle  platforms  for  obstacle  avoidance  and  waypoint  finding 
tasks.  With  consensus,  each  agent  or  node  would  make  heading 
or  velocity  updates  without  centralized  control.  Consensus  in  dis¬ 
tributed  sensing  environments  would  decentralize  the  requirements 
of  communication  and  computational  costs.  And  consensus  has 
been  used  as  a  mechanism  for  decentralized  network-wide  time 
synchronization. 

Detailed  discussion  of  the  state-of-the-art  may  be  found  in 
Section  III.  In  the  basic  setup,  nodes  i  —  1,  •  •  •  ,n,  have  initial 
estimates  or  measurements  and  the  problem  is  to  obtain  a 

common  weighted  average  aiXi{0)  via  decentralized  exchange 
of  messages.  Implicit  is  the  notion  of  local  exchanges  in  a  one- 
hop  neighborhood  (that,  for  example,  may  be  defined  by  an  SNR 
threshold  criterion  on  the  received  signal).  In  the  literature  the 
analysis  is  often  carried  out  by  assuming  that  the  underlying 
connectivity  matrix,  or  the  neighborhood  set  is  static,  and,  for 
wireless  applications,  a  random  geometric  graph  is  assumed  [2]. 
Similar  to  [6],  we  consider  a  broadcast  update,  but  unlike  it. 


we  explicitly  consider  a  fading  channel  and  take  into  account 
collisions.  Coupled  with  an  SNR  threshold  criterion,  this  leads  to 
a  random  neighborhood  set.  Under  this  model,  the  connectivity 
graph  is  not  symmetric  in  any  given  snapshot  (node  i  may  reliably 
receive  node  j’s  estimate  but  not  vice  versa),  and  the  consensus 
algorithm  is  not  sum-preserving.  We  consider  the  convergence  of 
the  consensus  algorithm  under  various  assumptions  on  the  fading. 
We  also  investigate  an  Aloha  type  random  access  scheme  and 
compare  performance  with  tree-based  approaches. 

Our  main  contributions  are  in  studying  the  effect  of  fading  and 
collisions  on  the  performance  of  wireless  consensus  gossiping  and 
in  comparing  its  cost  (measured  in  terms  of  number  of  transmitted 
packets  or  convergence  time)  with  that  of  a  tree-based  approach. 

II.  MODEL  AND  ASSUMPTIONS 

We  consider  a  network  of  n  nodes  with  locations  denoted  by  Vi, 
i  =  1,  We  consider  fading,  path  loss  and  additive  noise.  The 
fading  process,  g{t),  is  modeled  as  complex  circularly  symmetric 
noise  with  zero  mean  and  variance  (jg  =  1.  Let  Si(t)  denote  the  unit 
amplitude  signal  transmitted  by  node  i,  and  Po  the  fixed  transmitted 
power.  Then  the  signal  received  at  node  k  from  node  i  in  time  slot 
t  can  be  written  as 


yk{t)  —  QkiO 


Si{t)  +  Vkit) 


where  gkiit)  denotes  the  fading  coefficient  associated  with  the 
transmission  from  node  i  to  node  k,  Vk{t)  is  AWGN  with  common 
variance  a^,  dik  =  dki  is  the  distance  between  nodes  i  and  k,  and 
a  denotes  the  path  loss  exponent. 

We  assume  that  a  transmission  can  be  successfully  decoded  if  the 
SNR  exceeds  a  threshold  SNRt.  Let  do  denote  the  range  below 
which  this  threshold  SNR  is  exceeded  in  a  non-fading  channel,  i.e.. 


SNRt  = 


Po 


In  the  presence  of  fading,  the  effective  transmission  range  Vg,  i.e., 
the  range  at  which  the  SNR  threshold  is  exceeded,  is  random,  due 
to  the  randomness  of  the  fading  g{-).  Now,  node  i  can  decode  node 
j’s  transmission  successfully  if 


SNRig  >  SNRt, 


i.e.,  if  \gij \^d^°‘  >  do°‘  ■  When  g{t)  is  complex  Gaussian  (Rayleigh 
fading)  \g'\^  is  exponentially  distributed  with  parameter  1,  the 
probability  that  node  j  can  successfully  talk  to  node  i  is 

Pij  =  Pr(  SNRig  >  SNRt)  =  j 


Note  that  this  does  not  take  collisions  into  account. 

Example  1:  Ring  Topology.  If  n  nodes  are  uniformly  distributed 
on  a  ring  with  radius  r,  then 


dij  =  2r  sin 


(^) 
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Further,  with  rrio  =  J  where  do  is  the  nominal 

transmission  range, 

Pr(  SNRi^  >  SNRt)  =  exp  |-  f  |  . 

\  sin(7rmo/n)  J  J 

For  a  given  fade  value.  Ip],  the  effective  range  is  given  implicitly 
by 

m  =  arg maxdofe, s.t.  |g|^ 
k 

II-A.  The  basic  consensus  algorithm 

At  the  end  of  one  round  of  transmissions,  all  nodes  will  update 
their  estimates,  as 

Xi{t  +  1)  =  [1  -  mi{t)h]xi{t)  +  h  ^  Xj{t)  (1) 

where  t  is  discrete  time,  h  is  a  coupling  parameter,  Miit)  is  the 
set  of  nodes  that  successfully  transmit  to  node  i  in  round  t,  and 
mi{t)  =  |A/i(f)|  is  the  in-degree  of  node  i.  The  parameter  h  is  a 
measure  of  stubbornness  or  stiffness:  Smaller  values  of  h  reflect 
less  confidence  in  neighbors'  estimates  and  lead  to  slow  network 
convergence.  On  the  other  hand,  large  values  can  lead  to  oscillations 
and  lack  of  convergence.  We  can  allow  the  stubbornness  factor  to 
be  dependent  on  both  the  transmitting  and  receiving  nodes,  hij\ 

Xi{t  +  1)  =  [1  -  fx  ^  hij]xi{t)  +  fx  ^  hijXj{t) 

which  reduces  to  (1)  when  =  ft,  Vi,  j.  Note  that  the  protocol  is 
synchronous,  but  the  randomness  in  the  network  update  is  induced 
by  packets  that  are  dropped  due  to  fading  or  collisions. 

In  matrix  form 

x(f  +  1)  =  W(f)x(t)  (2) 

where  matrix  W(f)  is  defined  by 

f  1  “  ft  i  =  j 

jsAfiit)  (3) 

I  0  o.w. 

Let  A(t)  denote  the  adjacency  matrix,  D(i)  the  diagonal  matrix 
of  in-degrees,  and  L(f)  :=  D(f)  —  A(f),  the  graph  Laplacian.  If 
hij  —  ft,  Vi,j,  then 

W(t)  =  I  -  ftL(f)  . 

If  links  are  bidirectional,  instantaneous  disagreement  can  be  repre¬ 
sented  in  terms  of  the  Laplacian,  as  shown  in  [3],  [4], 

n 

=  \xi{t)-Xj{t)f  . 

i=i  jeUi(t) 

Thus,  one  can  think  of  CA  as  a  steepest  descent  algorithm.  We 
do  not  assume  that  links  are  bidirectional;  hence  neither  A(t)  nor 
L(f)  is  assumed  to  be  symmetric. 

Note  that  this  formulation  does  not  preclude  simultaneous  trans¬ 
missions  or  receptions;  collisions  resulting  in  decoding  failures 
will  simply  not  be  used  in  the  update  equation:  the  corresponding 
Wij  =  0. 


II-B.  Assumptions  on  the  Fading 

We  will  consider  the  performance  of  our  consensus  algorithm  in 
(1),  which  we  will  call  CA,  under  one  of  two  different  assumptions 
on  the  fading: 

(Al)  All  transmissions  in  a  given  round  suffer  the  same  fade;  fading 
is  independent  from  round  to  round. 

(A2)  All  fades  are  iid  across  transmitters  and  across  rounds. 

Assumption  Al  is  reasonable  if  the  nodes  are  clustered  and  the 
time  for  a  full  round  of  transmission  is  smaller  than  the  channel 
coherence  time.  Assumption  A2  is  reasonable  when  the  nodes  are 
dispersed  or  the  coherence  time  is  small  compared  with  the  duration 
of  one  round  of  transmissions.  Note  that  we  have  not  assumed  any 
specific  distribution  (such  as  Rayleigh)  for  the  fading  under  A1-A2. 

We  do  not  need  explicit  assumptions  on  scheduling;  however, 
in  this  section,  we  assume  that  there  are  no  collisions  if  nodes 
transmit  simultaneously.  This  may  be  achieved  if  nodes  have  multi¬ 
packet  reception  capability  ,  or  proper  scheduling  is  performed.  We 
illustrate  the  impact  of  collisions  (in  an  Aloha-based  scheme)  in 
Example  6  in  Section  VII. 

Our  CA  in  (1)  is  not  sum  preserving: 

n  n  n 

^  a;i  (f  -f  1)  =  '^Xj{t)'^Wij{t) 

i=l  J  =  1  i=l 

j 

where  nj{t)  is  the  number  of  nodes  that  can  successfully  hear  j 
in  round  t,  i.e.,  the  out-degree  of  node  j,  and  recall  that  mj{t) 
is  the  in-degree.  In  general,  mi{t)  7^  ni{t),  although  clearly 
Under  assumption  Al,  the  fading  is 
fully  correlated  (i.e.,  all  transmissions  in  a  given  round  experience 
the  same  fade);  hence  mi{t)  =  m{t)  =  ni{t)\  and  CA  is 
sum-preserving.  Under  A2,  the  rriiit)  and  riiit)  are  independent 
random  variables.  From  the  update  equation  in  (1),  we  can  conclude 
immediately  that  W(t)l  =  1;  i.e.,  1,  the  n  x  1  vector  of  ones,  is 
always  a  right  eigenvector  of  W.  However,  l'W(f)  7^  V  unless 
the  CA  is  sum-preserving.  The  connectivity  graph  is  not  regular 
(i.e.,  all  nodes  do  not  have  the  same  degree),  and  the  in-degree 
does  not  equal  the  out-degree,  except  under  AL  Further  note  that 
communication  links  are  not  assumed  to  be  symmetric  under  A2. 

III.  RELATED  WORK 

There  has  been  a  resurgence  of  interest  in  characterizing  consen¬ 
sus  and  gossip  algorithms  following  the  seminal  work  of  Xiao  and 
Boyd  [5],  in  which  a  randomly  selected  pair  of  nodes  exchange 
and  update  their  estimates  at  any  given  time  slot.  Conditions 
for  convergence  in  mean  and  mean-square  are  derived  in  [5].  In 
Ay  sal  et  al  [6],  at  a  given  instant,  a  single  node  broadcasts  its 
estimates  and  all  nodes  in  its  one-hop  neighborhood  update  their 
estimates;  conditions  for  convergence  in  mean  and  mean  square 
are  derived,  and  update  parameters  are  optimized.  Instantaneous 
consensus  suffices  when  the  focus  is  not  on  parameter  estimation 
per  se  but  on  arriving  at  consensus  in  some  protocol:  e.g.,  clock 
synchronization,  allocation  of  TDMA  slots  or  hop  frequencies. 
Barbarossa  et  al  [7]  consider  the  consensus  problem  on  a  regular 
ring  topology  and  derive  convergence  conditions.  Vanka  et  al  [8] 
also  consider  the  problem  in  a  ring  topology  and  in  a  2-D  topology. 
In  the  preceding  papers,  a  static  AWGN  channel  is  assumed.  The 
average  consensus  problem  under  iid  link  and  node  failures  has 
been  studied  by  Barooah  et  al  [9],  [10].  Wang  and  Elia  [11] 
establish  conditions  for  convergence  when  the  link  drops  are  iid 
and  delays  are  link-dependent. 

If  the  communication  links  are  static,  the  update  algorithm  is 
typically  written  as  x(f  -F  1)  =  Wx(f)  (see  eq.  2),  where  x(f)  is 
the  vector  of  sensor  estimates  at  time  t.  It  is  reasonable  to  assume 
that  in  the  absence  of  fading,  the  underlying  graph  Qo  and  con¬ 
sensus  algorithm  W  achieve  consensus;  if  W  is  doubly  stochastic 


d{mo) 

dok 


(which  does  not  demand  symmetry),  then  average  consensus  can 
be  achieved.  Now  fading  causes  link  drops  in  Qo,  yielding  Q{t) 
and  hence  W(f).  Our  algorithm  in  (2)  is  the  biased  compensation 
method  of  Fagnani-Zampieri  [12].  Flowever,  their  results  are  not 
applicable  since  they  explicitly  invoke  the  assumption  that  fades 
are  iid  in  time  (at  each  t,  G{t)  is  drawn  independently)  and  across 
links;  the  latter  assumption  is  equivalent  to  assuming  that  fades 
between  any  pair  of  nodes  is  independent  of  all  others.  Pereira  et 
al  [13]  consider  a  special  case  of  [12]  (the  underlying  graph  is 
fully  connected;  the  iid  fade  process  yields  an  Erdos-Renyi  graph); 
their  weight  matrix  is  W(f)  =  1  —  hL{t),  where  L(f)  is  the 
graph  Laplacian.  Different  from  [12],  they  assume  that  the  initial 
measurement  vector,  x(0),  is  a  random  vector  whose  elements  are 
iid;  they  derive  an  expression  for  average  MSE  as  a  function  of 
iteration  index  k,  and  optimize  the  value  of  the  coupling  coefficient 
h.  Because  of  the  iid  link  activation,  i!i{W(f)}  is  symmetric  and 
doubly  stochastic.  Link  drop  models  are  also  considered  in  [14]; 
here  the  link  drops  are  independent;  the  drop  probability  can  be 
link  dependent,  but  links  are  assumed  to  be  symmetric. 

Noisy  receptions  over  random  topologies  are  studied  in 
[17].  Flere,  the  coupling  coefficients  (the  weights  for  “others- 
information”)  are  time- varying,  I/2-summable  but  not  Li- 
summable.  This  persistence  condition  is  required  to  ensure  that 
the  estimate  is  consistent.  The  link  failure  model  is  general:  the 
Laplacian  is  iid  in  time  and  the  mean  Laplacian  has  non-zero 
second  eigenvalue.  Link  failures  can  be  correlated  in  space.  A  signal 
fading  model  rather  than  a  link  drop  model  is  investigated  in  [18]. 
Flere,  the  received  signal  is  modeled  as  y{t)  =  gx{t)  -|-  i.e.,  a 

fading  additive  noise  channel.  The  fades  are  assumed  known  to  the 
receiver  and  x{t)  is  binary  (0,1).  Consensus  to  the  initial  majority 
decision  is  sought. 

We  consider  an  SNR  threshold  model;  a  link  is  ‘on’  if  the 
received  SNR  exceeds  a  prescribed  threshold.  The  design  for  a 
nominal  AWGN  channel  is  assumed  to  yield  a  connected  graph 
Qo  on  which  consensus  can  be  achieved  (although  note  that  what 
is  important  is  that  the  effective  graph  be  connected:  fading  could 
make  a  disconnected  graph  connected  sufficiently  often).  Fading 
induces  link  drops  which  are  correlated;  the  link  drop  probability 
is  link  dependent.  We  assume  that  the  link  drops  are  iid  in  time.  We 
do  not  assume  the  weight  matrix  W (f)  or  its  mean  to  be  symmetric. 

Consensus  over  fading  channels  and  the  issue  of  deterministic 
vs.  random  schedules  has  been  considered  in  [19]  and  [20].  We 
consider  an  Aloha  based  approach  and  compare  performance  to 
that  of  tree-based  approaches. 

Our  model  is  different  from  [12],  [13],  [16],  as  we  do  not  assume 
that  link  drops  are  iid  (across  links)  or  have  the  same  probability  of 
drop  for  every  link.  (If  the  nominal  design  range  is  do  and  path  loss 
factor  is  a,  then  a  transmission  to  a  node  d  away  is  successful,  i.e., 
the  link  is  on,  if  \g\^  >  {d/do)°‘;  thus  the  link  drop  probability 
depends  upon  the  link.  Further  the  link  drops  associated  with  a 
given  transmitter  are  correlated;  if  the  link  to  a  node  at  distance 
di  is  off,  links  to  all  nodes  farther  than  di  are  off;  and  if  the  link 
to  a  node  d2  away  is  on,  links  to  all  nodes  closer  than  d2  are 
on.  Even  if  the  fading  coefficient  g{-)  associated  with  each  link 
is  iid,  the  probability  of  a  link  drop  would  be  link  dependent  as 
in  [14]).  Different  from  [13]  we  assume  that  the  initial  observation 
vector  is  deterministic.  Different  from  [14],  [15],  we  do  not  assume 
symmetric  links  in  G{t).  Different  from  [17],  our  weighting  of 
others  information  does  not  decay  with  time  and  does  not  satisfy 
the  square-summability  condition.  Different  from  [18]  the  initial 
node  variables  are  not  restricted  to  be  binary.  Our  model  generalizes 
those  in  [6],  [7]  to  include  fading. 

IV.  CONVERGENCE  TO  INITIAL  MEAN  VALUE 

Let  W  :=  j5{W(f)}  where  the  expectation  is  wrt  the  fading 
process,  g{t),  equivalently,  the  effective  range  Vg.  Now,  from  (3), 
we  obtain 

(4) 


where  Pij  is  the  probability  that  node  j  can  successfully  transmit 
to  node  i.  Let  lij  denote  the  indicator  for  successful  transmission 


from  j  to  i.  Then,  for  i 

=  7- 

Wait)  = 

1  —  hmi{t)  =  1  — 

Wait)  = 

l-hY,E{Pj} 

= 

1  —  h  Pij  := 

h  Xij  (t) 


1  —  hrrii 


(5) 


where  mi  is  the  expected  number  of  updates  received  by  node  i 
(its  mean  in-degree). 

Note  that  W  and  W  are  right  stochastic  (row  sums  are  unity).  If 
the  channel  between  every  pair  of  nodes  is  stochastically  reciprocal, 
i.e.,  Pij  =  Pji,  then  W  is  symmetric  (see  (4))  and  hence 

doubly  stochastic,  i.e.,  '^f.Wkj  =  ~  If  ^ 

chosen  such  that  1  —  hrrii  >  0,  Vi,  and  if  the  fading  process  is 
such  that  Pij  >  0,  Vj  7^  i,  then  all  the  entries  of  W  are  positive. 
The  former  can  be  assured  by  choosing  h  <  1/n.  The  Perron- 
Frobenius  theorem  then  applies,  and  we  can  conclude  immediately 
that  the  maximum  eigenvalue  of  W  is  unique  and  is  given  by  1, 
and  that  the  absolute  value  of  the  other  eigenvalues  are  all  less 
than  unity.  We  can  verify  that  1,  the  n  x  1  vector  of  I’s  is  the 
eigenvector  corresponding  to  A  =  1.  Let  J  :=  — ll'.  It  follows 
that  the  eigenvalues  of  W  —  J  satisfy' 


A(W- J)|  <  l,Vfc  .  or  p(W- J)  <  1 


where  p(  )  denotes  the  spectral  radius. 

If  W  is  symmetric,  i.e.,  Pij  =  Pji,  then  the  conditions  W1  = 
1,  I'W  =  1',  p(W  — J)  <  1  hold  and  suffice  to  ensure  that  [5], [6, 
Prop  1] 

lim  x(f)|  =  Jx(0) 


Remarks: 

1)  We  have  only  made  mild  assumptions  about  the  fading.  To 
ensure  that  the  entries  of  W  are  positive,  we  assumed  that 
Pij  >  0,  Vi  7^  j.  If  this  condition  does  not  hold,  then  the 
entries  of  W  are  only  non-negative.  The  spectral  radius  of 
W  is  still  unity,  Ai  =  1^,  but  |A|  =  1  may  be  a  repeated 
root,  which  means  that  we  cannot  claim  p(W  —  J)  <  1. 
Rayleigh  fading  satisfies  the  assumption  Pij  >  0,  as  do  other 
fading  processes.  This  assumption  implies  that  there  is  a  non¬ 
zero  probability  that  any  two  nodes  can  communicate.  The 
assumption  Pij  >0,  Vi  7^  j  is  sufficient,  but  not  necessary, 
as  illustrated  by  Example  3  in  Section  VII. 

2)  We  also  assumed  that  Pij  =  Pji  i.e.,  the  fading  is  stochasti¬ 
cally  symmetric.  We  illustrate  via  Example  4  in  Section  VII 
that  this  assumption  is  not  necessary. 

3)  We  did  not  need  to  invoke  any  of  the  assumptions  Ai-A2 
on  the  joint  distribution  of  the  fading,  i.e.,  fading  may  be  iid 
partially  or  fully  correlated. 

4)  Under  the  stochastic  reciprocity  assumption,  Pij  =  Pji 
which  is  satisfied  by  our  model  under  (A2),  W  is  symmetric, 
even  though  W(f)  need  not  be  symmetric  VL  Let  Qw(f)  := 
nLiW(f).  Suppose  that  Q(f)  has  rank  1,  whence  it  can 
be  written  as  Q(f)  =  Iv'.  Since  W(t)  is  stochastic  Vf,  it 
follows  that  Q(f  H-  1)  =  W(f  -f  l)Q(f)  =  Iv'.  Thus  if 
Q(f)  is  rank  one  and  symmetric  at  some  t,  it  remains  so 
regardless  of  W(f  +  k),  k  >  0.  Our  conjecture  is  that  if 

'Details  omitted  due  to  lack  of  space 

^We  will  assume  that  the  eigenvalues  are  ordered  so  that  is  the 
eigenvalue  with  the  fc-th  largest  absolute  value 


Wij  :=  E{Wij{t)} 


hPij ,  i  ^  j 


W(l)  is  symmetric,  then  Q(i)  will  be  symmetric.  We  test 
this  in  Example  5  in  Section  VII. 

Thus,  we  have  established: 

Result  1:  The  consensus  algorithm  in  (1)  converges  to  the  average 
initial  value,  ii  h  <  1/n  and  the  fading  satisfies  Pij  =  Pji  >  0, 
Vi  /  j. 

V.  CONVERGENCE  IN  SECOND  MOMENT 

We  saw  that  CA  converges  in  the  mean;  in  this  section,  we 
will  establish  convergence  in  mean-square.  First  we  consider  con¬ 
vergence  to  the  instantaneous  average  x(f)  =  Jx(f).  Define  the 
instantaneous  error  as  x(f)  =  x(f)  —  x(f)  =  [I  —  J]x(f). 

If  Amax  (i?{W(t)'(I  —  J)W(t)})  <  1,  then  Lemma  3  of  [6] 
assures  us  that  CA  converges  in  mean-square: 

iJ{||x(f)[|i}^0. 

Our  CA  is  sum-preserving  under  Al;  and  Lemma  3  reduces  to 
A2(W)  <  1,  where  A2  denotes  the  second-largest  eigenvalue.  We 
verified  this  condition  in  the  previous  section.  Thus,  we  consider 
assumption  A2  first.  We  can  verify  that  the  conditions  of  [6, 
Lemma  3]  are  satisfied  by  the  matrix  W  in  (3),  and  establish  the 
following. 

Result  2:  Under  A2,  the  consensus  algorithm  in  (1)  converges  to 
the  instantaneous  average  in  mean  square,  if  /i  <  1/n  and 
Y  :=  £;{W'(f)JW(f)}  has  full  rank. 

We  believe  that  Y  would  have  full  rank  if  the  degree  distribu¬ 
tions  (the  distribution  of  is  identical  for  all  the  nodes. 

VI.  CENTRALIZED  VS.  DISTRIBUTED 

Even  for  a  static  topology,  we  only  have  asymptotic  convergence 
results.  To  measure  the  convergence  time  in  [21]  the  authors 
introduced  the  e-averaging  time  which,  in  settings  similar  to  ours, 
has  been  shown  to  grow  as  O(n^).  In  contrast,  one  can  create  a 
tree  in  a  decentralized  fashion,  pass  values  to  the  root  of  the  tree, 
and  then  distribute  the  consensus  value,  with  a  finite  number  of 
exchanges.  The  price  paid  is  clearly  that  of  finding  the  appropriate 
routing  tables.  One  should  be  able  to  characterize  the  expected 
latency  for  this  (at  least  via  bounds).  For  the  case  of  link  failures, 
one  can  compute  the  average  wait  time  (number  of  retransmissions 
required)  and  thus  bound  the  expected  latency. 

We  consider  a  simple  distributed  tree  algorithm  similar  to  ones 
that  have  been  used  for  network  time  synchronization  in  wireless 
sensor  networks,  based  on  the  fact  that  nodes  must  first  learn  the 
local  topology  before  a  distributed  tree  can  be  found.  Fading  is 
ignored  during  the  formation  of  the  tree.  The  root  node  broadcasts  a 
level  0  beacon;  one  hop  neighbors  then  take  turns  to  transmit  level 
1  beacons.  Nodes  that  have  already  heard  a  beacon  ignore  later 
ones.  It  is  clear  that  all  nodes  will  take  turns  to  transmit  beacons; 
thus  n  beacons  will  be  transmitted.  Further,  an  ACK  mechanism 
is  required  so  that  each  parent  knows  how  many  children  it  has; 
there  will  be  (n  —  1)  ACKs.  Thus  the  total  number  of  packets 
for  tree  formation  is  2n  —  1.  We  have  ignored  scheduling  issues. 
Each  node  will  transmit  a  packet  to  its  parent  that  contains  the 
average  value  of  the  incoming  data  and  its  in-degree.  ACKs  are 
assumed,  and  retransmits  will  occur  until  successful  transmission. 
If  p  denotes  the  probability  of  link  success,  then  the  average  number 
of  retransmits  is  1/p.  A  total  of  (n—1)  successful  transmissions  is 
required  for  aggregation  at  the  fusion  center.  With  Pmin  denoting 
the  probability  of  success  over  the  weakest  link,  an  upper  bound  on 
the  expected  number  of  packets  is  n  -|-  n/pmin,  making  the  usual 
assumption  that  ACKs  are  perfect.  The  root  will  then  compute  the 
average,  which  will  then  be  disseminated  down  the  tree.  Again 
ACKs  are  required  and  a  node  will  retransmit  packets  until  all  its 
children  can  successfully  decode  it.  The  average  number  of  packets 
can  be  upper-bounded  by  n  -f  n/pmin.  Thus  the  average  number 


of  packets  for  tree  formation,  aggregation,  and  dissemination  can 
be  upper-bounded  by  2n(2  +  1/pmin). 

VII.  SIMULATION  RESULTS 

Example  2:  We  implemented  the  consensus  algorithm  over  a 
2D  random  network,  with  n  =  100  nodes,  coupling  parameter 
h  =  path  loss  exponent  a  —  2,  nominal  transmission  range 
fo  =  ^0.1,  0.3,  0.5}.  Figure  1  shows  MSE  vs.  iterations  for  a 
Monte  Carlo  simulation  over  iT  =  50  topology  realizations.  As 
expected  convergence  is  faster  when  the  range  ro  is  larger,  i.e., 
when  the  network  is  more  connected. 

Example  3:  In  Section  IV,  we  showed  that  Pij  >  0,  Vi,j, 
is  a  sufficient  condition  for  convergence  in  the  mean.  However, 
this  is  not  a  necessary  condition  (see  Remark  1  in  Section  IV). 
To  demonstrate  this,  we  simulated  CA  across  a  ring  network. 
(There  is  no  compelling  reason  to  run  a  consensus  algorithm  on 
a  ring  network,  since  faster  convergence  to  average  consensus 
can  be  achieved  by  simple  token  passing  to  the  nearest  neighbor 
in,  say,  the  clockwise  direction.  We  use  a  ring  network  only  to 
obtain  insights.)  Figure  2  shows  the  empirical  log(MSE)  from 
a  Monte  Carlo  simulation  (K  =  50).  The  network  consisted  of 
nodes  uniformly  placed  on  a  ring,  with  parameters:  network  size 
n  =  100,  update  parameter  h  =  ^,  nominal  range  mo  =  20,  path 
loss  exponent  a  =  2  (black  curve),  a  =  6  (blue  curve).  The  red 
curve  corresponds  to  the  case  where  the  range  is  chosen  randomly 
over  Unif[0,  20].  This  corresponds  to  the  case  where  some  of 
the  Pij’s  are  zero  (see  Remark  1),  indicating  that  the  condition 
Pij  >  0  Vi,}  is  not  necessary.  As  expected,  convergence  is  slower 
when  the  path  loss  exponent  is  larger  (connectivity  is  weaker). 

Example  4:  This  example  verifies  Remark  2,  that  the  condition 
Pij  =  Pji,  'ii,j  is  only  sufficient,  not  necessary.  Figure  3  shows 
log(MSE)  vs.  iteration  for  a  2D  network  with  n  =  100,  a  =  3 
from  a  Monte  Carlo  simulation  {K  =  50).  Connectivity  within 
the  network  is  determined  by  P,  where  Pij  ~  Uni f  [0.01,0.70]. 
20%  of  the  off-diagonal  terms  (i.e.,  links)  were  turned  off  (set  to 
0).  The  figure  shows  that  the  consensus  algorithm  will  converge 
to  the  initial  mean  value.  The  case  where  Pij  ^  ?7m/[0.01,  0.70] 
(i.e.,  Pij  >  0)  is  also  shown  in  Figure  3.  Convergence  is  faster  in 
the  latter  case,  as  expected. 

Example  5:  This  example  tests  the  conjecture  in  Remark  4, 
that  if  W(l)  is  symmetric,  then  convergence  could  be  faster.  We 
implemented  the  consensus  algorithm  over  a  ring  topology,  with 
n  =  101,  a  =  3,  and  nominal  transmission  range  mo  =  10. 
Figure  4  shows  log(MSE)  vs.  number  of  iterations  for  a  Monte 
Carlo  simulation  (K  =  50).  In  this  example,  W(l)  is  forced 
to  be  symmetric  (Wij{l)  =  Wji{l));  in  subsequent  rounds, 
unidirectional  links  are  allowed,  i.e.,  W(f)  need  not  be  symmetric. 
Figure  4  shows  that  convergence  is  faster  than  in  the  baseline  case 
where  symmetry  is  not  enforced  on  W(l). 

Example  6:  In  this  example,  we  adopt  a  standard  slotted 
Aloha  protocol  for  medium  access.  As  before,  a  SNR  threshold 
is  assumed;  simultaneous  transmissions  may  or  may  not  lead  to 
successful  transmissions  in  this  case.  We  consider  the  performance 
of  the  consensus  algorithm  with  respect  to  p,  the  probability 
that  a  node  chooses  to  broadcast  in  each  time  slot.  Figure  5 
shows  the  number  of  time  slots  necessary  to  achieve  consensus 
within  an  error  threshold  r  vs.  the  Aloha  parameter  p  in  a  2D 
random  network  with  n  =  {50,  100,  200},  a  =  3,  ro  =  0.1, 
SINRt  =  2,  and  r  =  0.01.  Also,  each  simulation  is  run  for 
T  =  10000  time  slots  or  until  the  MSE  falls  below  the  threshold. 
The  dashed  line  shows  the  number  of  packets  that  are  successfully 
decoded  (by  at  least  one  node);  the  total  number  of  transmitted 
packets  can  be  estimated  as  pnT;  results  shown  are  averages 
across  K  —  100  realizations  of  the  topology  and  initial  values. 


For  larger  networks,  the  choice  of  p  becomes  critical  and  using 
a  slightly  larger  value  of  p  is  better  than  choosing  a  smaller 
value:  the  benefit  due  to  fading  outweighs  the  loss  due  to  collisions. 

Example  7:  We  compare  the  energy  consumption  of  the  con¬ 
sensus  algorithm  with  that  of  the  tree-based  approach.  Figure  6 
shows  the  total  number  of  transmitted  packets  required  to  achieve 
consensus  within  r  error  vs.  the  nominal  communication  range  ro. 
The  #  packets  required  for  the  tree-based  algorithm  are  shown,  as 
well  as  those  for  CA  with  perfect  scheduling  and  for  the  Aloha- 
based  approach.  Figure  7  shows  a  histogram  comparing  the  total 
number  of  transmitted  packets  for  the  case  where  ro  =  0.3. 
Parameters  were  p  —  0.02,  SINRt  =  2,  r  =  0.01,  n  =  100, 
K  =  100,  T  =  5000.  The  total  required  number  of  transmitted 
packets  for  the  consensus  approach  is  determined  by  scaling  the 
number  of  iterations  required  by  the  size  of  the  network  n, 
which  results  in  the  discrete  plot  entries.  The  apparent  discrete 
distribution  of  the  Aloha-based  approach  is  due  to  the  interval 
chosen  for  the  histogram.  It  is  seen  that  the  tree-based  approach 
consumes  less  energy  than  CA  when  the  nominal  range,  ro,  is  large 
enough  to  ensure  sufficient  connectivity  to  establish  a  tree.  When 
ro  falls  below  this  threshold,  the  tree-based  approach  fails,  but 
CA  can  still  achieve  consensus.  As  the  nominal  communications 
range  increases,  connectivity  improves  and  CA  requires  fewer 
packets,  and  so  does  the  Aloha-based  approach.  Notice  that  random 
scheduling  requires  about  twice  the  number  of  packets  as  does 
perfect  scheduling. 


Fig.  1.  Example  2:  MSE  vs.  time  for  a  random  2-D  network;  To  is 
the  nominal  transmission  range. 


VIII.  CONCLUSIONS 

We  studied  the  convergence  of  consensus  algorithms  when  links 
are  not  symmetric.  We  showed  that  some  gain  can  be  obtained 
by  selectively  choosing  bidirectional  links.  We  also  show  that  the 
link  probability  need  not  be  non-zero  in  order  for  the  consensus 
algorithm  to  converge.  We  compared  the  performance  of  an  Aloha- 
based  scheme  with  that  under  perfect  scheduling,  and  we  investi¬ 
gated  the  relative  performance  of  consensus-based  algorithms  with 
tree-based  algorithms  in  a  fading  environment.  We  showed  that 
the  tree-based  algorithms  can  incur  less  communications  cost  than 
consensus  algorithms  except  provided  that  the  nominal  communi¬ 
cations  range  is  large  enough  to  permit  the  establishment  of  a  tree. 
A  possible  extension  is  to  use  geographic  gossip  type  protocols, 
which  include  some  routing;  this  will  probably  exhibit  a  behavior 
which  mediates  between  the  robustness  of  consensus  and  the  speed 
of  the  tree  approach. 


Fig.  2.  Example  3:  MSE  vs.  time  for  a  ring  network 


Fig.  3.  Example  4:  MSE  vs.  time  for  a  2D  random  network  with 
asymmetric  link  probabilities  Pij;  the  solid  line  corresponds  to  the 
case  where  some  of  the  Pij  are  zero. 


Fig.  4.  Example  5:  MSE  vs.  time  for  a  ring  network.  In  Case  2, 
only  bidirectional  links  are  used  in  round  1,  so  that  W(l)  is  forced 
to  be  symmetric. 


Fig.  5.  Example  6:  Total  time  vs.  Aloha  transmission  probability  p 
for  a  2D  random  network;  n  is  the  number  of  nodes. 


Fig.  6.  Example  7:  Total  #  packets  vs.  nominal  range  for  different 
algorithms,  for  a  2D  random  network. 


Fig.  7.  Example  7:  Histogram  of  total  #  packets  for  tree-based 
approach,  consensus,  and  aloha-based. 
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